CDS

Accession Number TCMCG078C08268
gbkey CDS
Protein Id KAG0461480.1
Location join(32633531..32633751,32640217..32640432,32640526..32640637,32641131..32641193,32641280..32641418,32642237..32642292,32642567..32642640,32642814..32642961,32643118..32643291,32643489..32643576,32643672..32643788,32644160..32644257,32644366..32644483,32644557..32644651,32645167..32645273,32645348..32645441)
Organism Vanilla planifolia
locus_tag HPP92_021777

Protein

Length 639aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA633886, BioSample:SAMN14973820
db_source JADCNL010000011.1
Definition hypothetical protein HPP92_021777 [Vanilla planifolia]
Locus_tag HPP92_021777

EGGNOG-MAPPER Annotation

COG_category F
Description AIR carboxylase
KEGG_TC -
KEGG_Module M00048        [VIEW IN KEGG]
KEGG_Reaction R04209        [VIEW IN KEGG]
KEGG_rclass RC00590        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K11808        [VIEW IN KEGG]
EC 4.1.1.21        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00230        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko01110        [VIEW IN KEGG]
map00230        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map01110        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGCTGCTTGGCTCTGCTCCACTTTCCACCTTTGTCCCATCGAGCAATCGCCATCTCTCAGACCTCGGCGGCCATTCTTTATTCTTTCTCATAGGACGAACCGGCGCTGTTGGGCGGATCCGTCTCCGGCAGCAGAGCATGGCATCTTCACCCTCGTTGACAAGGTCATTCCGATTGCAAGCCACGATGCAAACAGACAATCATGTTTCTTCCTCACAGCGGGATGAGTGCTCTACTATCCATGGAGTTTCTAGGACCATTGTCGGTGTTTTGGGAGGAGGTCAGTTGGGTAGAATGCTATGTCAAGCTGCTAACATCATGTCTGTTAAGGTCATGATATTGGATCCCCTAGAAAATTGCCCAGCAAGCCCTCTTTCATATCAGCATTTTGTTGGCAGATTTGACGATGGTGATGCTGTCCGTGAGTTTGCAAAGAGATGTGGAGTGTTAACTATAGAGATTGAACATGTTGATGCTGTTACACTAGAAAAATTGGAGCAACAAGGCGTTGATATCCAGCCAAAACCATCCACGATAAGAATAATACAGGACAAATATATTCAGAAGGTTCATTTTTCTCAACATGGGATTCCACTTCCTGATTTCATAAAGATAGATAACCTTGAAGGTGCTGAAAAAGCAGGCGAGCTATTTGGTTATCCTTTGATGGTTAAAAGTAGAAGGCATGCATACGATGGGCGTGGAAATGCTGTCGCAGATTGTAAAGAAAAGCTTACTTCGGCTATTGGAGCATTGGGAGGATATGAATGCGGCTTATATGTTGAGAAGTGGACTTCATTTGCAAAGGAGCTTTCAGTCATCGTTGCAAGGGGGAGGGATGGTTCAGTTTTATGCTATCCTGTAGTAGAAACTATTCATAAAGAGAACATTTGCCATATTGTTGAAGCTCCTGCCGATGTACCTGAAACAATAAAGAAGCTTTCTATTGATGTTGCTACTAGAGCTGTTGGTTCACTAGCGGGAGCAGGAGTATTTGCTGTAGAGTTATTTTTAACACATAATGGACAAGTTTTGCTGAATGAAGTAGCTCCAAGACCACACAATAGTGGACATCACACGATCGAATCTTGTTATACATCTCAATATGAACAGCACTTAAGAGCTATTCTAGGTCTTCCACTAGGCAACACATCCATGAAGGTCTCGGCTGCCATCATGTACAACATACTTGGTGAGGATGAGGGAGAGCAAGGCTTCCATTTAGCTCATCAAATTATGAGAAGAGCATTGAGCATTCCTGGAGCTTCAGTTCATTGGTATGACAAACCAGAAATTCGGAAGCTACGAAAAATGGGGCATGTCACGATTGTTGGCCCTTCAAAGAGCTATGTCAAGAACAACTTGCGATCAATGTTGGATGGAGAAACCACTGAAAGCCATGTTTCAGATACTCCTCAAGTTTCCATAATCATGGGCTCCGATTCCGATCTTCCTACGATGAAGGATGCCGCAGAAATCTTCAGGAATTTTCATGTGCCATTTGAGATGACAATTGTTTCGGCACATCGAACACCCGAAAGGATGTATTCTTTTGCGTTGTCTGCGAAAGAAAGGGGCATTCGGATAATCATAGCCGGTGCTGGTGGTGCTGCTCATTTACCAGGTATGGTGGCTTCATTGACTCCTTTGCCTGTTATAGGAGTTCCAATTAGGACCTCTTCTTTAGATGGGTTTGATTCACTGTTGTCTATTGTGCAGATGCCAAAAGGTATACCAGTTGCAACAGTTGCAATAGGAAATGCAGCAAATGCTGCCCTTCTTGCAATCAGAATTCTAGCAACCAGTGATGATGAACTATGGGAAAGAGTGAAGAACTACCAAGAAGAACTGAAGGATACTGTTTTGAAGAAGGCAGAAAAGTTAGAGGGAGAAGGTTGGGAGAGATATTTAAATCCTTGA
Protein:  
MLLGSAPLSTFVPSSNRHLSDLGGHSLFFLIGRTGAVGRIRLRQQSMASSPSLTRSFRLQATMQTDNHVSSSQRDECSTIHGVSRTIVGVLGGGQLGRMLCQAANIMSVKVMILDPLENCPASPLSYQHFVGRFDDGDAVREFAKRCGVLTIEIEHVDAVTLEKLEQQGVDIQPKPSTIRIIQDKYIQKVHFSQHGIPLPDFIKIDNLEGAEKAGELFGYPLMVKSRRHAYDGRGNAVADCKEKLTSAIGALGGYECGLYVEKWTSFAKELSVIVARGRDGSVLCYPVVETIHKENICHIVEAPADVPETIKKLSIDVATRAVGSLAGAGVFAVELFLTHNGQVLLNEVAPRPHNSGHHTIESCYTSQYEQHLRAILGLPLGNTSMKVSAAIMYNILGEDEGEQGFHLAHQIMRRALSIPGASVHWYDKPEIRKLRKMGHVTIVGPSKSYVKNNLRSMLDGETTESHVSDTPQVSIIMGSDSDLPTMKDAAEIFRNFHVPFEMTIVSAHRTPERMYSFALSAKERGIRIIIAGAGGAAHLPGMVASLTPLPVIGVPIRTSSLDGFDSLLSIVQMPKGIPVATVAIGNAANAALLAIRILATSDDELWERVKNYQEELKDTVLKKAEKLEGEGWERYLNP